RG

MLA & Load Balance

Dual Pipe & Cross-Node All-to-All Communication

FP8 Training

Multi-Token Prediction & Inference (Prefilling & Decoding)

Reinforcement Learning on the Base Model

Reinforcement Learning with Cold Start

3fs